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THE USE OF A PROGRAMMED COMPUTER TO PERFORM REAL-TIME 

COMPANDING OF HIGH-QUALITY SOUND SIGNALS 

G.W. McNaily, B.Sc. 



1. Introduction 

It is economically desirable to reduce the bit rate of a 
digitally coded sound signal so as to maximise the pro- 
gramme-carrying capacity of a communications network. 
One method of achieving this is to reduce the number of 
bits transmitted per sample by a companding technique. 
Several techniques have been proposed and it is highly 
desirable that a common standard should be reached. 

Until recently, the only way to assess these compan- 
dors was by simulation, or by the construction of special- 
purpose hardware. Both of these methods are inflexible 
in terms of the changing and optimising of parameters of 
the companding system. However it is shown in this Report 
that a small computer which possesses the required speed 
and performance can be used to execute these companding 
operations in real time. Using this approach, companding 
systems (and many other audio-signal processing systems) 
can be assessed quickly with complete control over the 
variables of the system. 

This Report describes how programs were developed 
to compare six companding systems which are currently 
under examination for possible use in national and inter- 
national broadcasting chains. 



2. Companding in digital audio systems 

Companding is a technique for improving the signal-to- 
noise ratio in sound-signal processing recording and trans- 
mission systems. It can also be used in a p.c.m. system to 
make more efficient use of the available number of quan- 
tising levels, for example to reduce the bit-rate required for 
transmitting a signal. Such companding techniques have 
been investigated as possible means for making optimum 
use, for high-quality sound-signals, of the 2048 kbit/s level 
in the digital transmission hierarchy. The background to 

2 T 4 

these techniques is well documented ' ' and only a brief 
description will be given here. 

In instantaneous companding, each digital word, 
representing one sound sample, is compressed or 'mapped' 
into another digital word containing fewer bits, according 
to a compression law, with a transfer characteristic that is 
segmented and piece-wise linear (see Section 2.1.1). This 
law essentially has small intervals between quantising levels 
for signals of small amplitude and large intervals for signals 
of high amplitude. By these means the amount of quan- 
tising noise imposed on small signals can be reduced in 
exchange for imposing more noise on large signals which 
tend to mask the noise, and a subsequent reduction in the 
number of bits transmitted for each sample can be obtained. 
As most of the compressed output words must correspond 
to one or more input words, the mapping is 'many-to-one'; 
therefore the process is irreversible and information is des- 
troyed. 



To decode this compressed signal to its original form 
requires an expander, which exactly inverts the process of 
the compressor. In digital systems this is easy to achieve 
but, as the overall process Is irreversible, impairments are 
introduced, e.g. the signal-to-noise ratio varies with the 
signal amplitude and may be audible as programme- 
modulated quantising noise. 

A second method which may be used is the use of a 
companding law in the 'near-instantaneous' mode. In 
this system, a block of say 32 successive samples is coded 
with an accuracy that is determined by the largest sample 
value in the block; the larger this sample, the lower the 
accuracy. A scale-factor word indicating the quantising 
accuracy is transmitted once per block, and almost the 
entire number of quantising levels is available for each 
segment of the law (see Section 2.1.2). The process is 
again essentially irreversible, but high-level signals are 
coded to a higher degree of accuracy than they are with 
instantaneous companding. 

In the compandors outlined below, the action is 
represented by the notation iJV^, N2.N3), where Nl is the 
number of bits per sample before compression, N2 the 
number of bits per sample transmitted after compression, 
and N3 is the effective number of bits per sample after 
expansion. 

2.1. Companding laws 

2.1.1. A-law instantaneous compandor (14-10—14) 

The A-law was designed to maintain a near-constant 
signal-to-quantising-noise ratio over a wide range of input 
signal levels in telephony.'^ Companding laws are skew- 
symmetrical about the origin and Fig. 1 shows the positive 
quadrant of the A-law. If, at the origin, the slope repre- 
sents linear coding over the whole of the input signal range 
with n bits/sample, then the slope of the seventh segment 
represents quantisation with n - 6 bits/sample. The total 
number of levels required for transmission using the A-law 
is 2"""'* requiring « - 4 bits/sample. 

For high-quality sound-signals, the input to the com- 
pressor is linearly encoded with 14 bits/sample. The 
slopes of the segments are related by factors of 2, and the 
relationship between the 10 transmitted bits/sample and 
the output of 14 bits/sample is illustrated in Table 1, 
which shows the complete compression and expansion 
process in numerical form, and the use of rounding to 
minimise errors caused by necessary truncation. The 
truncation of least significant bits (LSBs) in the compressor 
means that the maximum error produced could be as large 
as one LSB of the transmitted word. By adding '^LSB 
before restoring the correct level of significance in the 
expander, the maximum error is reduced by half. 
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Fig. 1 - A-law characteristic 







2.1.2. BBC NICAM 1 (13-10-13) 

This system uses a near-instantaneous compandor 
devised by the BBC Research and Designs Departments for 
use in a Near-Instantaneous Compandor And Multiplex 
system (NICAM); it uses a four segment law,"^ and permits 
six high-quality sound channels in a 2048 kbit/s multiplex. 
The use of a near-instantaneous compandor gives the advan- 
tage that only a small number of bits are required for the 
scale-factor word to denote the operative segments and 



almost the entire number of quantising levels is available 
for each segment of the law. 

The input to the compressor is a 13 bits/sample 
linearly-encoded sound-signal, with an added 'dither' signal 
to mask distortion of low-level signals. The compressor 
can be used in one of four ranges, the range to be used for 
any given block of 32 samples being determined by the 
magnitude of the largest sample (designated X) in that 
block. This is illustrated in Fig. 2 and Table 2. 
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Fig. 2 - Characteristics followed with NICAM 1 near-instantaneous companding (ignoring scale factor word) 



The output from the compressor consists of blocks 
of 32, 10-bit words with a scale factor of 2 bits. For small 
amplitude signals, i.e. range 4,* full 13-bit accuracy is 
obtained even though only 10 bits are transmitted, as the 
three most significant bits are known to be zero, while for 
large signals, i.e. range 1, the three least significant bits are 



* Note that the literature on near-instantaneous companding assigns 
the higher numbered ranges to lower-level signals whereas the 
opposite sense Is applied to A-law companding. 



truncated so that only 10 bit accuracy is obtained. Expand- 
ing follows the reverse procedure except that after expan- 
sion, rounding is used to minimise the error in the same 
way described for the A-law compandor. 

2.1.3. TDF system 1 (14-9y2-13y2) 

This near-instantaneous companding system was 
proposed by Telediffusion de France so that the overall bit 
rate of one coded sound channel would be 320 kbit/s. 
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* 54 in the case of the TDF 9-bit system (Section 2.1.4). 

qlv is the ratio of the value of the quantising step used to code the block of samples, to the maximum signal value at the 

coder input 
N is the number of bits that would be required in a linear p.c.m. system if this range were used to code the whole 

range of the input signal. 



precisely equivalent in bit-rate to five telephone channels, 
each at the CCITT bit-rate of 64 kbit/s. 

Each sample is linearly encoded with 14 bits/sample, 
and each block of 32, 14-bit words is then transformed 
into a block of 32, 10-bit words using a five-segment com- 
pression law, and the scale factor is coded with 3 bits. 
Parity, scale-factor, and synchronisation bits are accom- 
modated for transmission by suppressing the least-signifi- 
cant bit in every second word thereby releasing 16 kbit/s 
for this purpose. At the receiving terminal, the inverse 
process is performed, and the untransmitted least-signifi- 
cant bits are systematically made equal to zero in the 
process of digital-to-analogue conversion in the decoder. 

The allocation of parity bits etc. is described in 
detail in unpublished committee papers but, fortunately, 
for assessing the subjective performance of the system, this 
information is not required when writing the computer 
program. The precise nature of the compression law is 
given in Table 2. 

2.1.4. TDF system 2 (14~9--14) 

This is a modified version of the system described 
in Section 2.1.3; extra bits are released for synchronisation 
of the serialised data stream. 

Each sample is linearly encoded with 14 bits/sample, 
and then each block of 54, 14-bit words is transformed 
into a block of 54, 9-bit words using a six-segment com- 
pression law, and the scale factor is coded with 3 bits. 
Thus the overall data rate is 288 kbit/s releasing 32 kbit/s 
for the scale factor, error correction, signalling, and im- 
proved synchronisation. 

At the receiver the inverse process is carried out, but 
in contrast to TDF system 1, there is no impairment of low- 
level signals due to untransmitted least-significant bits. 



2.1.5. BBC NICAM 2 (14-11-14) 

This is a near-instantaneous companding system 
similar to NICAM 1, but giving lower programme-modu- 
lated noise and idle-channel noise, and permitting five high- 
quality sound channels in a 2048 kbit/s multiplex. 

Each sample is linearly encoded with 14 bits/sample 
and no dither is added in view of the additional bit/sample 
relative to NICAM 1. Each block of 32, 14-bit words is 
transformed into a block of 32, 11 -bit words using a four- 
segment compression law, and the scale factor is coded with 
2 bits. Expansion follows the reverse process, except that, 
after expansion, rounding is used to minimise the error. 

Table 2 gives the details of the compression law. 

2.1.6. BBC NICAM 3 (14-10-14) 

This is a near-instantaneous companding system 
which is similar to NICAM 2, except that it provides an 
extra 32 kbit/s for improved error protection purposes, in 
exchange for additional (but acceptable) programme-modu- 
lated noise equivalent to that for NICAM 1. 



3. The software approach 

The companding techniques described in the previous 
paragraphs can all be implemented using a fast digital pro- 
cessor under the control of a stored program. There are 
several advantages in this kind of approach. Firstly, using 
a single piece of hardware made flexible through software, 
reduces the time taken in the design and development of 
dedicated equipment; secondly, changes in the system can 
be readily achieved by altering the stored program in con- 
trast to making circuit changes; and finally the equipment 
can have many other applications in signal processing and 
process control. 
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3.1. The basic equipment 

The heart of the equipment is a commercial 'mini- 
processor' namely a Plessey Miproc. This consists of a 
high-speed central processing unit (CPU) with memory 
space for Ik* instructions and 0-5k data words. The word 
length is 16 bits and the circuits use Schottky TTL tech- 
nology on two printed circuit cards (235 mm x 160 mm). 
A third 'housekeeping' card controls a display, a keyboard 
and a paper-tape reader. Programs prepared on paper tape 
may be loaded via the reader and stored in the program and 
data memories which consist of random access memories. 
This latter feature is useful when different programs are 
being tested and minor changes are frequently made. 

Input and output (I/O) arrangements have been added 
so that serialised digital sound data can be interfaced 
directly with the processor. To improve the performance 
of the processor, a hardware multiplier has been incor- 
porated in preference to the slower method of programming 
the processor to multiply by a sequence of instructions; by 



this means a 16 x 16 bit multiplication can be executed by 
just three instructions. 

The architecture of the machine is such that most 
instructions (ADD, SHIFT, etc.) can be performed in a 
single clock cycle of 350 ns. For digital sound data 
sampled at 32 kHz, this permits 89 instructions to be 
executed between successive samples. With careful pro- 
gramming, this is sufficient for a large number of signal 
processing applications to be investigated. 

The equipment is depicted in Fig. 3, and it is described 
more fully elsewhere. 

3.2. Program development 

There are three stages in producing a program in a 
suitable form to load into the computer: — 

(i) Expression of the problem in flow-diagram form. 



* k represents 1024. 



(ii) Translation of the flow diagram into a sequence of 
mnemonic statements, termed a 'source program'. 
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Fig. 3 - The computer and associated equipment 
_6- 



(Mi) Assembly of the source program as an 'object code' 
with a binary format. 

The first stage in this process requires no further 
elaboration. The reasons for the second stage are as 
follows. The flow diagram will contain numerous jumps 
and conditional branches, etc., to instructions which are 
out of the normal sequence. If the program were to be 
written directly in the object code, the jump and branch 
destinations would have to be known as absolute memory 
locations. Once these locations have been specified, any 
alteration to the program is likely to be very complicated 
since these locations will have to be respecified with an 
attendant high risk of error. 

It is therefore useful to write a source program of 
mnemonic statements which permit the use of labels and 
symbols to be used for register and memory locations. 
Using readily available computer software the source pro- 
gram can be easily edited or changed. When the source 
program is complete, a 'cross-assembler' is used to translate 
it into the object code by replacing each mnemonic with 
appropriate binary codes. It is possible to write simple 
programs directly in object code, but assembler programs 
usually have routines for dumping the object code auto- 
matically on to a suitable medium, e.g. paper tape, and for 
providing listings as a written record; further, many can 
provide error listings and 'debugging' instructions to track 
down faults. 

In the case of the companding programs the software 
for editing and assembly was accessed via a time-shared 
computer bureau. A program to simulate the Miproc was 
also available, but this was not used since the programs 
could be readily tested on the actual mini-processor in real 
time. 



The aim is to assess the effects on signal quality that 
the companding systems may introduce and hence it is 
important that switching between the systems can be done 
easily. Further, various parameters, such as block size in 
the near-instantaneous systems, are easily altered so that 
their particular effects can be studied. Since it is the com- 
panding action that is of interest, the programs do not con- 
trive to serialize the data stream or insert the scale factor, 
etc., and the technique used is simply to compress the 
digital words according to the selected law and then expand 
them back again. 

The programs run in real time and because of the 
stringent speed requirements, an interruptable system of 
I/O organisation cannot be used. To ensure that no con- 
flicts can occur between the status of I/O ports and the 
program requirements, a software loop in each program 
synchronises the start of each program-run to the audio- 
sampling pulses. 

Each program has a similar principle of operation in 
which full advantage is taken of the factor of two relating 
the slopes of the segments in each companding law. When 
the scale factor has been determined, the word is shifted to 
a more significant position under the control of the scale 
factor and a logical aperture selects the appropriate number 
of bits for 'transmission'. This corresponds to compression 
of the input signal. Rounding is then applied, and the 
word is shifted back again to its original position of signifi- 
cance. Thus, the expansion process is completed and the 
results may be converted back to analogue form for sub- 
jective assessment. 



5. Instrumentation 



4. Description of program 

All six companding systems can be demonstrated with 
one master program using approximately 500 instructions 
and 160 words of data memory. Selection switches are 
monitored by the program so that the appropriate system 
program is executed. 



A block diagram of the complete signal-companding 
system is shown in Fig. 4. The pre- and de-emphasis is 
important in companded p. cm. systems to reduce pro- 
gramme-modulated noise. Using a characteristic according 
to CCITT Rec. J.I 7 (as shown in Fig. 5) reduces the 
audibility of programme-modulated noise to an extent 
corresponding to one bit/sample. 



Miproc 
processor 



-14 bit 
input pre-emphasis limiter ado. 



_:^ 



digital recorder 

^, „„^ _ 14 bit 



d.a.c. de-emphasis 



output 




Fig. 4 - The experimental system for comparing the effects of different companding laws 
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Fig. 5- CCITT pre-emphasis characteristic 
as used in ttie comparison tests 
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The output of the pre-emphasis network and delay- 
line (non-overshooting) limiter* is linearly coded to 14 bit 
accuracy using a high-precision converter, and is interfaced 
with the computer. The computer processes the data 
according to the program selected and, in normal opera- 
tion, the processed data is then converted back to 
analogue form and de-emphasised, so that it may be 
listened to and evaluated. 

In many applications, such as companding, it is 
useful to know how impairments may accumulate if the 
same process is repeated several times in cascade. By using 
a 14 bit digital recorder^ to store the results of inter- 
mediate processing it is possible to assess these conditions, 
even when the impairments introduced by one process are 
very small. This facility was used to assess the effects of 
cascading up to eight coders and decoders in tandem. The 
results of the subjective comparison of the companding 
systems using these techniques are described in another 
report. 



6. Conclusions 

The use of a program-controlled computer to execute 
precisely the processing involved in a number of digital 
companding systems has made it possible to compare the 
systems quickly and easily, while maintaining flexibility so 
that minor variants or widely different systems can be 
evaluated. Together with a digital recorder, the com- 
bination is capable of many other signal processing applica- 
tions particularly where a particular process may be re- 
peated several times. As faster computing elements are 
developed, these techniques will become even more bene- 
ficial in audio signal-processing research, and also in 
specific operational applications in broadcasting and re- 
cording. 
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